Introduction to Auto Scaling Group (ASG)

Auto Scaling Group

Auto Scaling Group Attributes

It is possible to scale an ASG based on CloudWatch alarms
An alarm monitors a metric (such as Average CPU, or a custom metric)
Metrics such as Average CPU are computed for the overall ASG instances
Based on the alarm:
- We can create scale-out policies (increase the number of instances)
- We can create scale-in policies (decrease the number of instances)

CPUUtilization: Average CPU utilization across your instances
RequestCountPerTarget: to make sure the number of requests per EC2 instances is stable
Average Network In / Out (if you’re application is network bound)
Any custom metric (that you push using CloudWatch)

After a scaling activity happens, you are in the cooldown period (default 300 seconds)
During the cooldown period, the ASG will not launch or terminate additional instances (to allow for metrics to stabilize)
Advice: Use a ready-to-use AMI to reduce configuration time in order to be serving request faster and reduce the cooldown period

High Availability vs Scalability (vertical and horizontal) vs Elasticity vs Agility in the Cloud
Elastic Load Balancers (ELB)
Distribute traffic across backend EC2 instances, can be Multi-AZ
Supports health checks
4 types: Classic (old), Application (HTTP - L7), Network (TCP - L4), Gateway (L3)
Auto Scaling Groups (ASG)
Implement Elasticity for your application, across multiple AZ
Scale EC2 instances based on the demand on your system, replace unhealthy
Integrated with the ELB